The paper describes the key concepts of a word spotting system for Russian based on large vocabulary continuous speech\nrecognition. Key algorithms and system settings are described, including the pronunciation variation algorithm, and the\nexperimental results on the real-life telecom data are provided. The description of system architecture and the user interface\nis provided. The system is based on CMU Sphinx open-source speech recognition platform and on the linguistic models and\nalgorithms developed by Speech Drive LLC. The effective combination of baseline statistic methods, real-world training data, and\nthe intensive use of linguistic knowledge led to a quality result applicable to industrial use.
Loading....